Continuous Task - Speci c Categories for Disambiguation : Putting Lexical Constraints ( back ) in the Lexicon
نویسنده
چکیده
Syntactic ambiguity resolution often appears to be guided by the use of lexical constraint information (e. A traditional linguistic framework of discrete syntactic categories can complicate the modeling of such lexical constraints. There is evidence that syntactic ambiguity resolution is exposure-based, i.e. present analyses are innuenced by past experiences (e.g. Mitchell and Cuetos (1991)). An important feature of such exposure-based models is that similar instances are recognized in order to be able to make the appropriate generalizations. We have developed a method to construct a continuous representation of lexical similarities { the Lexical Space { based on local (within several words to the right and left) distributional properties of words in a corpus. The distributional organization of LexSpace can be exploited for a task-speciic syntactic categorization or clustering of words with an arbitrary degree of granularity according to the demands of a speciic language processing task. In this paper we describe several models which perform a disambiguation task on the basis of learning from examples. We compare the use of syntactic representations of words from LexSpace with task-speciic generalizations inherent to the Example-based models. We consider two tasks, prepositional phrase attachment and direct object/sentence complement ambiguity resolution, and compare the generalization performance of the methods on unseen test material. The results suggest that Lexical Space provides us with rich lexical representations that can also be used for similarity-based analysis as required in exposure based strategies. Moreover, the LexSpace representation can be used as input to connectionist models. The Lexical Space representations were found to work successfully, and it was shown that the Example-based method generalizes to new test material with an accuracy (84.1%) that is equal to the best method known for this task in the literature (Collins and Brooks, 1995). However, these optimal results were not obtained with the Lexical Space method but with the Overlap IG metric.
منابع مشابه
Context Effects in the Processing of Phonolexical Ambiguity in L2
In order to comprehend speech, listeners have to combine low-level phonetic information about the incoming auditory signal with higher-order contextual information to make a lexical selection. This requires stable phonological categories and unambiguous representations of words in the mental lexicon. Unlike native speakers, second language (L2) speakers, who perceive nonnative sounds through th...
متن کاملCombining Independent Knowledge Sources for Word Sense Disambiguation
Disambiguation Yorick Wilks and Mark Stevenson Department of Computer Science, University of She eld, Regent Court, 211 Portobello Street, She eld S1 4DP, UK fyorick, [email protected] Abstract Sense tagging, the automatic assignment of the appropriate sense from some lexicon to each of the words in a text, is a specialised instance of the general problem of word sense disambiguation. We di...
متن کاملCat egorial Disambiguat ion
This paper presents an implemented, computationally inexpensive technique for disambiguating categories (parts of speech) by exploiting constraints on possible category combinations. Early resolutions of category ambiguities provide a great deal of leverage, simplifying later resolutions of other types of lexical ambiguity. Prioritized pattern action rules (CONDS) are specified for each known c...
متن کاملDemonstration of Multi Statutory of the Adjective “Just” in Modern Adjectival English Lexicon
This article concerns the general functional features of an adjective in modern English, and in particular multistate lexical item “just”, which carries different meanings in different variants of combinatorics. The authors analyze the combinations used with the adjectival lexeme item “just” and reveal the categories that determine the degree of semantic content of each given statement. The nee...
متن کاملAn empirical approach to Lexical Tuning
NLP systems crucially depend on the knowledge structures devoted to describing and representing word senses. Although automatic Word Sense Disambiguation (WSD) is now an established task within empirically-based computational approaches to NLP, the suitability of the available set (and granularity) of senses is still a problem. Application domains exhibit speci c behaviors that cannot be fully ...
متن کامل